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^^ The goal of this article is to give an algebraic characterisation of the ab- 

stract syntax of functional programming languages, equipped with reduction 
rules. 

We introduce a notion of 2-signature: such a signature specifies not only 

the terms of a language, but also reduction rules on those terms. 

^> To any 2-signature S we associate a category of "models" of S, and we 

i— i prove that this category has an initial object. The initial object deserves the 

name syntax associated to S, and it is equipped with reductions as specified 

h-j by S. 

c/2 Thus we obtain a characterisation of abstract syntax with reduction rules 

via a universal property. By construction of the category in question, its 

initial object is automatically equipped with a substitution operation that is 

^ compatible with reduction in a suitable sense. 

CS Initiality yields a category-theoretic iteration operator which allows to 

specify reduction-preserving maps, i.e. translations, on the syntax. 
ly-j The initiality theorem is formalized in the proof assistant Coq, yielding 

^1 a machinery which, when fed with a 2-signature, provides the associated 

(^ syntax, certified substitution and the iteration operator. 
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Introduction 

Initial Semantics characterizes the terms of a language associated to a signature S as 
the initial object in some category — whose objects we call Semantics of S — , yielding 
a concise definition of the abstract syntax associated to S 1 . 

(Programming) languages come with varying features, such as variable binding, typing 
or reduction rules. Integration of those features into the initiality result — and thus into 
the syntax — can be done by defining a suitable category of Semantics. 

We present an initiality result for untyped syntax equipped with reduction rules. The 
prime example, which we use as a running example throughout the paper, is given by 
the untyped lambda calculus equipped with beta reduction. 

A language is specified in two steps: at first, we specify the terms of the language by 
means of a 1-signature, say, S. Afterwards, we give reduction rules on the terms of the 
syntax associated to S using inequations over S. 

We call 2~signature any pair (S, A) consisting of a 1-signature S and a set A of 
inequations over S. Our main theorem states that under some mild condition on the 
inequations of A, the category of representations of (S, A) has an initial object. This 
initial object can be considered as the abstract syntax associated to S with reductions 
according to the inequations of A. 

The theorem has been fully certified in the proof assistant Coq. The Coq theory files 
as well as online documentation are available on the author's web page 2 . 

Our approach carries over to simply-typed syntax. However, for sake of simplicity, we 
restrict ourselves to untyped syntax here and explain the integration of types in another 
paper (cf. last section). 

Summary 

We characterise programming languages equipped with reduction rules as initial object 
in some category, thus giving an algebraic definition of abstract syntax with semantics. 
This characterisation of syntax with reduction rules is given in two steps. 

1. At first pure syntax is characterised as initial object in some category. Here we use 
the term "pure" to express the fact that no semantic aspects such as reductions 
on terms are considered. This characterisation is actually given in [HM07b]. 

2. Afterwards we consider inequations specifying reduction rules. Given a set of re- 
duction rules for terms, we build up on the preceding result to give an algebraic 
characterisation of syntax with reduction. 



1 Note that the word "semantics" is used with two different meanings in this work: on the one hand, the 
objects of the category of "models" of a signature are usually called Semantics, and thus the syntax 
associated to the signature becomes the Initial Semantics. On the other hand we use semantics 
to denote "meaning of programs" in form of reduction rules. We use capitalization to distinguish 
between the two uses. 

2 http://math.unice.fr/~ahrens 



In summary, the merit of this work is to give an algebraic characterisation of syntax 
with reduction rules, building up on such a characterisation for pure syntax given by 
Hirschowitz and Maggesi [HM07b]. 

Our approach is based on relative monads 3 [ACU10] from the category Set of sets to 
the category Ord of preorders and modules over such monads. Compared to traditional 
endomonads, monads allow for different categories as domain and codomain. The notion 
of modules over monads generalizes monadic substitution. 

We now explain the above two points in more detail: 

Pure Syntax 

An arity is a list of natural numbers, specifying the number and binding behaviour of a 
constructor, and a 1-signature is a family of arities. To any 1-signature S we associate 
a category of representations — or "models" — of S. 

This category has an initial object (cf. Lem. 39), which deserves the name (pure) 
syntax associated to S. As mentioned above, we use the term "pure" to distinguish 
this initial object from the initial object associated to a 2-signature, which gives an 
analoguous characterisation of syntax with reduction rules (cf. below). 

Initiality for pure syntax is actually a consequence of an initiality theorem proved 
by Hirschowitz and Maggesi [HM07b]: we exhibit a pair of adjoint functors (cf. Lem. 
38) between our category Rep (S) of representations of S and that of Hirschowitz and 
Maggesi, named Rep (S), 



Rep A (S) -I Rep(S) . (0.1) 

To prove the lemma, we then use the fact that left adjoints are cocontinuous, thus 
the image under the functor A of the initial object in the category Rep(S) is initial in 
Rep A (5). 

Syntax with Reduction Rules 

Given a 1-signature S, an S -inequation E = (0,7) associates a pair (a R ,^ R ) of parallel 
morphisms in a suitable category to any representation R of S. In a sense made precise 
later, we can ask whether 

a R < 7 R . 

If this is the case, we say that R satisfies the inequation E. 

A 2-signature is a pair [S, A) consisting of a 1-signature S, which specifies the terms 
of a language, together with a set A of S -inequations, which specifies reduction rules on 
those terms. 



3 Since we'll mostly talk about relative monads in this paper, we refer to the traditional monads as 
endomonads and reserve the term "monad" for relative monads. 



Given a 2-signature (S, A), we call representation of (S,A) any representation of S 
that satisfies each inequation of A. The category of representations of (S, A) is defined 
to be the full subcategory of representations of S whose objects are representations of 
(S,A). 

We would like to conjecture the existence of an initial object in the category of repre- 
sentations of (5, ^4) and hence must rule out inequations which are never satisfied. We 
call soft 5-inequation any S'-inequation whose codomain is of a particular form. Our 
main result states that for any set A of soft S-inequations the category of representations 
of (S, A) has an initial object. The class of soft inequations is large enough to account 
for all the reduction rules we are interested in; in particular, beta and eta reduction are 
given by soft inequations. 

Our definitions ensure that any reduction rule between terms that is expressed by an 
inequation E S A is automatically propagated into subterms. The set A of inequations 
hence only needs to contain some "generating" inequations, a fact that is well illustrated 
by our example 2-signature A/3 of the untyped lambda calculus with beta reduction. 
This signature has only one inequation /3 which expresses beta reduction at the root of 
a term, 

(XM)N < M[* := N] . 

The initial representation of A/3 is given by the untyped lambda calculus, equipped with 
the reflexive and transitive beta reduction relation -»p [BB94]. 

Related Work 

Initial Semantics results for syntax with variable binding were first presented on the 
LICS'99 conference. Those results were concerned only with the syntactic aspect of 
languages: they characterize the set of terms of a language as an initial object in some 
category, while not taking into account relations on terms. In lack of a better name, we 
refer to this kind of initiality results as purely syntactic. 

Some of these initiality theorems have been extended to also incorporate semantic as- 
pects, e.g. in form of equivalence relations between terms. These extensions are reviewed 
in the second paragraph. 

Purely syntactic results Initial Semantics for "pure" syntax — i.e. without considering 
semantic aspects -- with variable binding were presented by several people indepen- 
dently, differing in the modelisation of variable binding: 

Gabbay and Pitts' nominal approach [GP99, GP01, Pit03] uses a set theory enriched 
with atoms to establish an initiality result. Their approach models lambda abstraction as 
a constructor which takes a pair of a variable name and a term as arguments. In contrast 
to the other techniques mentioned in this list, in the nominal approach syntactic equality 
is different from a-equivalence. 

Hofmann [Hof99] and Miculan and Scagnetto [MS03] prove initiality results modelling 
variable binding in a Higher-Order Abstract Syntax (HOAS) style. 



Fiore et al. [FPT99] (also [Fio02, Fio05]) modify the classical notion of initial algebras 
by replacing sets by presheaves over finite sets, constructing fresh, to-be-bound vari- 
ables by a _+ 1 functor, corresponding to context extension, on those presheaves. Vari- 
able binding is modelled through nested datatypes as introduced by Bird and Meertens 
[BM98]. 

Later Tanaka and Power [TP05] generalize and subsume those three approaches to a 
general category of contexts. An overview of this work and references to more technical 
papers can be found in [Pow07] . 

Hirschowitz and Maggesi [HM07b] prove an initiality result for untyped syntax based 
on the notion of monads and modules over monads. Their work has been extended to 
simply-typed syntax by Zsido [ZsilO]. 

Incorporating Semantics Ghani and Liith [GL03] characterize equational theories (with 
a symmetry rule) resp. rewrite systems (with reflexivity and transitivity rule, but without 
symmetry) via a different universal property, namely as coequalizers resp. coinserters in 
a category of monads on Set resp. Ord. 

Fiore and Hur [FH07] have extended Fiore's work to integrate semantic aspects into 
initiality results. In particular, Hur's thesis [HurlO] is dedicated to equational systems 
for syntax with variable binding. Thus, in contrast to the approach described in this 
work, Hur has a symmetry rule for reductions. 

Hirschowitz and Maggesi [HM07b] prove initiality of the set of lambda terms modulo 
beta and eta conversion in a category of exponential monads. In an unpublished paper 
[HM07a] they introduce the notion of half-equation and equation to express equality 
between terms. 

In the same paper they mention the idea of using of preorders as a different approach 
to model operational semantics and they interpret the untyped lambda calculus with 
beta and eta reduction relation as an endomonad over the category Ord of preordered 
sets. 

In this paper we adopt both Hirschowitz and Maggesi's notion of half-equation as 
well as Ghani and Liith's idea of using preorders to model semantic aspects of syn- 
tax. According to the use of preorders, we consider a pair of parallel half-equations as 
an inequation, defining a relation, rather than as an equation. This emphasizes the dy- 
namic viewpoint of reductions as directed equalities rather than the static, mathematical 
viewpoint one obtains by considering symmetric relations. 

However, we consider not endomonads but instead relative monads as defined by 
Altenkirch et al. [ACU10], that is, monads with different source and target categories. 
The rationale behind this decision is that variables are unstructured sets, terms of a 
language carry structure in form of reduction relation. So it is reasonable to suppose 
variables and terms to live in different categories, which is possible through the use 
of relative monads instead of endomonads. The choice of A (cf. Def. 2) as underlying 
functor permits — via an adjunction expressed in Def. 2 — the use of any map - not 
necessarily monotone - for substitution, cf. Ex. 4. 

T. Hirschowitz [Hir], taking the viewpoint of Categorical Semantics, defines a category 



Sig of 2-signatures for simply-typed syntax with reduction rules, and constructs an 
adjunction between Sig and the category 2CCCat of small cartesian closed 2-categories. 
He thus associates to any signature a 2-category of types, terms and reductions verifying 
a universal property. More precisely, terms are modelled by morphisms in this category, 
and reductions are expressed by the existence of 2-cells between terms. Variable binding 
is modelled through exponentials, corresponding to an encoding of variable binding via 
Higher-Order Abstract Syntax (HOAS). 

Synopsis 

In the first section we review the definition of (relative) monads and define modules over 

those monads as well as their morphisms. Some constructions on monads and modules 

are given, which will be of importance in what follows. 

In the second section we define arities, half-equations and inequations, as well as their 

representations. Afterwards we state our main result. The running example in the first 

two sections is the 2-signature A/3 of the lambda calculus with beta reduction. 

In the third section we describe some elements of the formalization of the main theorem 

in the proof assistant Coq. 

Some conclusions and future work are stated in the last section. 

1 Relative Monads & Modules 

As explained in the introduction, we will associate a category of Semantics, or "models", 
to any signature. This section reviews and introduces the category-theoretic structures 
from which these Semantics are built: 

an object of the category consists of a relative monad (on the functor A, cf. Def. 
2) together with some extra data, namely a morphism in a suitable category for each 
arity of the signature. In particular, the Initial Semantics — the syntax — associated 
to a signature comes with its underlying monad. The examples 4 and 12 constitute the 
initial object in the category of Semantics of the lambda calculus with beta reduction, 
the running example of Sec. 2. 

1.1 Modules over Relative Monads 

We review the definition of relative monad as given by Altenkirch et al. [ACU10] and 
define suitable morphisms for them. As an example, we consider the lambda calculus 
as a relative monad from sets to preorders, on the functor A. The definitions and 
constructions of modules over endomonads as used by Hirschowitz and Maggesi [HM07b] 
easily carries over to relative monads. The definitions and constructions given here for 
modules over relative monads are analogous to those for modules over endomonads 
[HM10, ZsilO]. 

Definition 1 (Relative Monad, [ACU10]): Given categories C and T> and a functor 
F : C —7- T>, a (relative) monad P : C — > T> on F is given by the following data: 



• a map P: C — > T> on the objects of C, 

• for each object c of C, a morphism rj c £ T>{Fc, Pc) and 

• for all objects c and d of C a substitution map 

cr C)d : £>(Fc, Pd) -»• D(Pc, Pd) 

such that the following diagrams commute for all suitable morphisms / and g: 
Vc . „ „ „ er(/) 






The name "substitution map" is justified when e. g. C is instantiated with a suitable 
category of sets or contexts and PX is a set of terms with variables in the set X. The 
diagrams then express the well-known substitution properties [ACU10]. 
A monad P can be equipped with a functorial structure (also denoted by P) by setting, 
for a morphism / : a — > b in C, 

P(f):=]iit P (f):=a(r,oFf), 

the functoriality axioms being a consequence of the monad axioms. 
We are mainly interested in monads on the functor A: 

Definition 2 (A): We call A : Set — > Ord the functor from sets to preordered sets 
which associates to each set X the set itself together with the smallest preorder, i.e. the 
diagonal of X, 

A(X):=(X,A X ). 

The functor A is left adjoint to the forgetful functor, 

a x , Y ■ Ord(AX,y) ^ Set(X, UY) . 

Lemma 3 (Monads over A and Endomonads): Let P be a monad on A. By postcom- 
posing with the forgetful functor U : Ord —> Set we obtain an endomonad 

P:Set^Set . 

The substitution is defined, for f £ Set(a, UPb) by setting 

a p {f):=u(a{a-\f))) , 

making use of the adjunction f \-> a~ 1 (f) G Ord(Aa, Pb) of Def. 2. 

Conversely, to any endomonad Q : Set — > Set we associate a relative monad 



AQ : Set A Ord 



by postcomposing with the functor A. 



Example 4: Consider the set of all lambda terms indexed by their set of free variables 

LC(V) ::= Var : V -> LC(V) 

| Abs : LC(y') -»■ LC(V) 

| App : LC(F) x LC(y) -)• LC(V), 

where V := V + {*} is the set 1/ enriched with a new distinguished element, i.e. a 
context extended by one additional free variable. Altenkirch and Reus [AR99] interpret 
LC as an endomonad over sets. 

We equip each LC(V) with a preorder taken as the reflexive-transitive closure of the 
relation generated by the rule 

(3 : (XM)N < M[* := N] 

and its propagation into subterms. This defines a monad from sets to preorders 

LC/3 : Set A Ord . 

The family ?y LC is given by the constructor Var, and the substitution map 

a x ,Y : Ord(AX,LC/3(Y)) -)■ Ord(LC/3(X),LC/3(V)) 

is given by simultaneous subsitution. Via the adjunction of Def. 2 the substitution can 
also be read as 

a x ,Y ■ Set(X,LC(Y)) -»■ Ord(LC/?(X),LC/?(Y)) . 

The substitution can hence be chosen as for the endomonad LC, but one has to prove 
the additional property of monotonicity in the first-order argument. 

For two monads P and Q from C to T> a morphism of monads is a family of morphisms 
r c S T>(Pc, Qc) that is compatible with the monadic structure: 

Definition 5 (Morphism of Relative Monads): Given two relative monads P and Q 
from C to P on the functor F: C —> T>, a morphism of monads from P to Q is given by 
a collection of morphisms r c G T>{Pc, Qc) such that the following diagram commutes for 
any suitable morphism /: 

Pc > Pd Fc > Pc 



Qc 





<r Q (r d of) 



As a consequence from these commutativity properties the family r is a natural trans- 
formation between the functors induced by the monads P and Q. 



Monads over F : C — > T> and their morphisms form a category RMon(i ? ) where identity 
and composition of morphisms are simply defined by pointwise identity and composition 
of morphisms. 

Lemma 6 (Adjunction between RMon(A) and Mon(Set)): The maps defined in Lem. 
3 give rise to an adjunction between the category of monads over A and the category of 
endomonads over sets, 

a PtQ : Hom(AP, Q) ^ Hom(P, Q) . 

Given a monad P over F : C —> T>, the notion of module over P generalizes the notion 
of monadic substitution: 

rp 

Definition 7 (Module over Relative Monad): Let P: C — > T> be a relative monad and 
let £ be a category. A relative module M over P with codomain £ is given by 

• a map M : C — > £ on the objects of the categories involved and 

• for all objects c, d of C a map 

q c4 : V(Fc,Pd) -»• £{Mc,Md) 

such that the following diagrams commute for all suitable morphisms / and g: 

Mc > Md Mc 





Me ^ Mc 



A functoriality for such a module M can then be defined in the same way as for monads. 
The following examples of modules are instances of more general constructions ex- 
plained in the next section: 

Example 8: The map LC/3 : V *-> LC/?(V) yields a module over the monad LC/3, the 
tautological module LC/3. 

Example 9: Let V' := V U {*}. The map LC/3' : V i-+ LC/3(y') can be equipped with 
a structure of an LC/3-module, the derived module of the module LC/3 of Ex. 8. 

Example 10: The map V h-> LC/3(V) x LC/3(V) can be equipped with a structure of 
an LC/3-module. 

A module morphism is a family of morphisms that is compatible with module substi- 
tution: 



10 



Definition 11 (Module Morphism): Let M and N be two relative modules over P: C — >• 
T> with codomain £. A morphism of relative P-modules from M to iV is given by a 
collection of morphisms p c G £(Mc, Nc) such that for any morphism / E D(Fc, Pd) the 
following diagram commutes: 

Mc >Md 



Pc 



Pd 



The modules over P with codomain £ and morphisms between them form a category 
called RMod(P, £). Composition and identity morphisms of modules are defined by 
pointwise composition and identity, similarly to the category of monads. 

Example 12 (Exs. 8, 9, 10 cont.): Abstraction is a morphism of LC/3-modules, 

Abs : LC/3' ->• LC/3 . 

Similarly, application is a morphism of LC/3-modules, 

App : LC/3 x LC/3 -»■ LC,5 . 

1.2 Constructions on Monads and Modules 

The following constructions of modules are important in what follows. 
The first definition explain Ex. 8: 

Definition 13 (Tautological Module): Given a monad P on F : C — > T>, we define the 
tautological module (also denoted by P) over P to be the module (M, <r) := (P,a), i.e. 
with object map P and module substitution given by the monad substitution. Thus the 
monad P can be considered as an object in the category RMod(P, T>). 

Definition 14 (Constant and terminal module): Let P be a monad over F : C — > T>. 
For any object e £ £ the constant map T e : C — > £, c i— > e for all c G C is equipped 
with the structure of a P-module. In particular, if £ has a terminal object i£ , then the 
constant module c i— >■ l£ is terminal in RMod(P, £). 

Let P and Q be two monads on F : C — > T>. Given a morphism h : P — )■ Q of monads, 
we can turn any Q-module M into a P-module, by "pulling it back" along h: 

Definition 15 (Pullback module): Let h : P — > Q be a morphism of monads on F : C — > 
T> and let M be a Q-module with codomain £. We define a P-module /i*M to £ with 
object map c h* Mc by setting 

^M {f):=q M {hdQf) 

This module is called the pullback module of M along h. The pullback extends to module 
morphisms and is functorial. 
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Remark 16: The pullback P-module h*M has the same underlying functor as the 
Q-module M. It is merely the substitution action that changes: while s M expects 
morphisms in C(Fc,Qd) as arguments, the substitution of h*M expects morphisms in 
C(Fc,Pd). 

Definition 17 (Induced module morphism): With the same notation as before, the 
monad morphism h induces a morphism of P-modules h: P —> h*Q. 

Remark 18: Note that the preceding two constructions do not change the functor resp. 
natural transformation underlying the module resp. morphism of modules. 

The following construction explains Ex. 10: 

Definition 19 (Products): Suppose the category £ is equipped with a product. Let M 
and N be P-modules with codomain £ . Then the map (on objects) 

C ->■ £, c4 Mcx Nc 
is equipped with a module substitution by setting 

S MxN (f) := ? M (/) x , N (f) . 
This construction extends to a product on RMod(P, £). 

For our definition of half-equations in the next section, we need a category where 
modules over different monads are grouped together: 

Definition 20: Let C,D and £ be categories and F : C — > T> be a functor. We define 
the category LMod(P, £) ("L" for "large") to be the category whose objects are pairs 
(P, M), where R is a monad over F : C — >■ 2? and M is an P-module. A morphism from 
(P, M) to (S, N) is a pair (p, r) where p : P — > S is a monad morphism and r is an 
P-module morphism r : M — ?► p*N. 

We are particularly interested in monads over the functor A : Set —> Ord. The 
following construction — derivation — applies to modules over such monads. 

1.3 Derived Modules 

Roughly speaking, a binding constructor makes free variables disappear. Its input are 
hence terms in an extended context, i.e. with (one or more) additional free variables 
compared to the output. Derivation is about context extensions. 
Formally, given a set V (V for variables), we consider a new set 

V' :=V + {*} 

which denotes V enriched with a new distinguished element - the "fresh" variable. The 
map V i— > V can be extended to a monad on the category of sets and is hence functorial. 
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Given a map / : V —> W and w £ W, we call 

default(/, w):V'-*W 
the coproduct map denned by 

default(j»(t) := T' lf * = * ' 

1 JW \f(v), Ht = veV . 

Definition 21: Given a monad P on A : Set —> Ord and a P-module M with codomain 
£, we define the derived module by setting 

M'(V) := M(V') . 

For a morphism / G Ord(AV, PVF) the module substitution for the derived module is 
given by 

? M '(/):=? M (shift(/)) . 

Here the shifted map 

shift(/) G Ord(A(V'), P(W')) 

is defined as the image under the adjunction of Def. 2 of 

ay i iy(shift(/)) := default (P(z) o f,r](*)) , 

the map i : W — >■ W' being the inclusion map. 

Derivation extends to an endofunctor on the category of P-modules with codomain E. 

Remark 22: When P is a monad of terms over free variables, the map shift/ sends 
the additional variable of V to r/ p (*), i.e. to the term consisting of just the "freshest" 
free variable. When recursively substituting with a map / : V — > P(W), terms under a 
binder such as A must be substituted with the shifted map shift(/). 

Definition 23: Given a natural number n, we write M n for the module M derived n 
times. Given a list s = [n\, . . . , n m ] of natural numbers, we write M s := M ni x . . . x M nm . 
Product and derivation are functorial, and we use the same notation on morphisms. 
That is, given a morphism of P-modules p : M — > N, we write 

p s := p ni x . . . x p nm :M S -+N S . 

The pullback operation commutes with products and derivations : 

Lemma 24: Let C and T> be categories and £ be a category with products. Let P and 
Q be monads on F : C —> T> and p : P — >■ Q a monad morphism. Let M and N be 
P-modules with codomain E. The pullback functor is cartesian: 

p*(M x N) = p*M x p*N . 
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Lemma 25: Consider the setting as in the preceding lemma, with C = Set, T> = Ord 
and F := A. Then we have 

p*{M') = (p*M)' . 

Definition 26: We denote by wOrd the category whose objects are preordered sets and 
whose morphisms are simply maps of sets, not necessarily monotone. Given a monad 
P : Set —7- Ord, and a P-module M with codomain Ord, we can consider M as a 
P-module with codomain wOrd. We denote this module by M. 
For any set X, we define the substitution of just one variable, 

subst x : P(X') x P(X) -> P{X), (y, z) ^ y[* := z) := a (default (rjx, z)){y) . 

This defines a morphism of P-modules with codomain wOrd, 

subst p : P x P ^ P . 

Remark 27: Note that the substitution module morphism defined above is by construc- 
tion monotone in its first argument, but not in its second argument. This is the reason 
why we cannot consider subst as a morphism of P-modules 

subst p : P' x P ^ P , 

but have to switch to the "weaker" category wOrd. This fact and a way to ensure 
monotonicity also in the second argument are explained more generally in Rem. 56. 

2 2-Signatures and their Representations 

An arity describes the number of arguments and binding behaviour of a constructor of 
a syntax. A 1 -signature S is a family of arities and as such specifies a syntax. An 
inequation over S — also called S-inequation — expresses relations between the terms 
of the syntax associated to S. A 2-signature consists of a 1-signature S and a set of 
5-inequations. 

Representations of a 1-signature are defined analogously to [HM07b, ZsilO], except 
that we use relative monads and modules over such monads. 

Our inequations are precisely Hirschowitz and Maggesi's equations [HM07a], i.e. par- 
allel pairs of half-equations. We simply interpret such a pair to define a relation rather 
than an equality. Throughout this section we continue the running example of A/3. 

2.1 Arities, 1-Signatures and their Representations 

Definition 28 (Arity, 1-Signature): An arity is a list of natural numbers. A 1-signature 
is a family of arities. 

Example 29: The 1-signature A of the untyped lambda calculus is given by the two 
arities 

app:=[0,0] , abs := [1] . 
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Definition 30 (Representation of an Arity): Let s := [ni,ri2,--. ,n m ] be an arity and 
R be a monad over the functor A. A representation of the arity s in the monad R is an 
.R-module morphism 

s R :R s ^R . 

Note that we use the notation defined in Def. 23. We call 

domO, R):= R s = R ni x ...x R" m 

the domain module of the representation of s in R. 

Definition 31: A representation R of a 1-signature S is given by a monad P over the 
functor A and, for each arity s G S, a representation a of s in P. 

Given a representation i?, we denote its underlying monad by R as well. 

Remark 32: A representation of a 1-signature a la Hirschowitz and Maggesi [HM07b] 
is defined analogously, except for the use of endomonads on the category of sets instead 
of relative monads. 

Definition 33: To any representation R of a 1-signature S in a relative monad R as 
defined in Def. 31 we associate a representation R of S in the endomonad R in the sense 
of Zsido [HM07b] by postcomposing the representation module morphism of any arity s 
of S with the forgetful functor from preorders to sets: 

s R :R S -+R h+ ( s «) : (R>) = R s -». R . 

Conversely, to any representation Q of S in an endomonad Q over sets we can associate 
a representation AQ of S 1 in the relative monad AQ over A, by postcomposing the 
representation module morphisms with A. 

Example 34 (Ex. 29 continued): A representation R of the 1-signature A is given by 

• a monad R : Set — > Ord and 

• two morphisms of i?-modules in RMod(i?, Ord), 

app K :RxR^R and abs^ : R 1 -»■ i? . 

Morphisms of representations are monad morphisms which commute with the repre- 
sentation morphisms of modules: 

Definition 35 (Morphism of Representations): Let P and Q be representations of a 
signature S. A morphism of representations f : P — > Q is a morphism of monads 
/ : P — ?► Q such that the following diagram commutes for any arity s of S: 

s p 

pS yp 

r f 

f « w 
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Note that we make extensive use of the notation defined in Def. 23. To make sense 
of this diagram it is necessary to recall the constructions on modules of section 1.2. 
The diagram lives in the category RMod(P, Ord). The vertices are obtained from the 
tautological modules P resp. Q over the monads P resp. Q by applying the derivation and 
pullback functors as well as by the use of the product in the category of P-modules into 
Ord. The vertical morphisms are module morphisms induced by /, to which functoriality 
of derivation and products are applied. Furthermore instances of lemmas 24 and 25 
are hidden in the lower left corner. The lower horizontal morphism makes use of the 
functoriality of the pullback operation. 

Example 36 (Ex. 34 continued): Let P and R be two representations of A. A morphism 
from P to R is given by a morphism of monads / : P — > R such that the following 
diagrams of P-module morphisms commute: 

app p 



Px P >P 



fxf 



f 




f*(R x R) > R 

r(&PP R ) " /*(abs«) 

Definition 37 (Category of Representations): Representations of S and their mor- 
phisms form a category Rep (S). 

Lemma 38: The assignment of Def. 33 extends to an adjunction between our category 
of representations Rep (S) in monads over A and Hirschowitz and Maggesi's category 
Rep(S') of representations in endomonads over sets: 

a P ,Q : Rep A (5)(AP,Q) * Rep(S)(P,Q) . 

Note that the right adjoint is called U in the introduction, cf. Eq. (0.1). 

Lemma 39: Given a signature S, the category of representations of S in relative mon- 
ads on A has an initial representation. Its underlying monad associates to any set V of 
variables the set of terms of the language specified by S with free variables in V , equipped 
with the diagonal preorder. 

Proof. This is a direct consequence of the fact that left adjoints preserve colimits, thus, 
in particular, initial objects. □ 

Remark 40: The formalization in Coq of Lem. 39 (cf. Sec. 3.2) does not appeal to 
Hirschowitz and Maggesi's result, but constructs the initial object from scratch. 

2.2 Inequations over 1-Signatures 

Hirschowitz and Maggesi [HM07a] define equations to be pairs of half-equations to ex- 
press equalities between terms. We simply interpret a pair of half-equations as inequation 
rather than equation. 
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Definition 41 (Category of Half-Equations, [HM07a]): Let S be a signature. An S- 
module U is a functor from the category of representations of S to the category of 
modules LMod(A, wOrd) commuting with the forgetful functor to the category of relative 
monads over A. We define a morphism of S'-modules to be a natural transformation 
which becomes the identity when composed with the forgetful functor. We call these 
morphisms half -equations. These definitions yield a category which we call the category 
of S'-modules (or the category of half-equations). 
We sometimes write 

U§ := U(R)(X) 

for the value of an S-module at the representation R and the set X. Similarly, for a 
half-equation a : U — > V we write 

Remark 42: A half-equation a from S-module U to V associates to any representation 
P a morphism of P-modules a p : U(P) —> V(P) in RMod(P, wOrd) such that for any 
morphism of ^-representations / : P —> R the following diagram commutes. 



(P, U(P)) a > (P, V(P)) 



(f,u(f)) 




(f,v(f)) 



(R, f*(U(R))) > (R, f*(V(R))) 



Lemma 43: The category of S-modules is cartesian. 

Definition 44 (Algebraic S-module): We call algebraic any S-module verifying the 
following inductive predicate: 

• The map : R i— >• (R, R) is an algebraic S-module. 

• If the S-module M : R^ (Mi(i?) , M 2 (R)) is algebraic, so is 

M' :R^ {M 1 {R),M 2 {R) 1 ). 

• If M and TV are algebraic, so is 

M x N : R^ (M 1 (R),M 2 (R) x N 2 (R)). 

Using the same notation as in Def. 23, any list of natural numbers defines uniquely an 
algebraic S-module. 

Definition 45: The substitution operation 

subst : R ^ subst^ : R' x R ->■ R 
is a half-equation over any signature S. Its domain and codomain are algebraic. 
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Example 46 (Ex. 29 continued): The map 

R h. app R o(abs H , id K ) : R' x R ^ R 

is a half-equation over the signature A. 

Definition 47: Any arity s = [n\, . . . ,n m ] £ S defines an algebraic S'-module 

dom(s) :R^R S . 

Definition 48 (Inequation, 2-Signature) : Given a 1-signature 5, an S -inequation is a 
pair of parallel half-equations between S-modules. We write 

a< 7 : U -+V 

for the inequation (0,7) with domain U and codomain V. 

A 2-signature is a pair (S, A) of a 1-signature 5 and a set A of S'-inequations. 

Definition 49 (Representation of Inequations): A representation of an S -inequation 
ct < 7 : f7 — > V is any representation R oi S such that a < 7 pointwise, i.e. if for any 
set X and any y £ U(R)(X), 

a%(y) < 7x(y) • 

We say that such a representation i? satisfies the inequation a < 7. 

For a set vl of S'-inequations, we call representation of (5, A) any representation of 
S that satisfies each inequation of A. We define the category of representations of the 
2-signature (S, A) to be the full subcategory in the category of representations of S 
whose objects are representations of (S, A). 

Example 50 (Ex. 46 continued): We denote by j3 the A-inequation 

appo(abs,id) < subst . (/?) 

We call A/3 the 2-signature ((app, abs), j3). 
A representation P of A/3 is given by 

• a monad P : Set — > Ord and 

• two morphisms of P-modules 

app:PxP->P and abs : P' ->■ P 

such that for any set X and any y £ P(X') and z £ PX 

app x (ahs x {y),z) < y[* := z] . 



18 



2.3 Initiality for 2-Signatures 

Given a 2-signature (S, A), we would like to conjecture that its category of representa- 
tions has an initial object. However, we have to rule out inequations which are never 
satisfied, since an empty category obviously does not not have an initial object. We 
restrict ourselves to inequations with an algebraic codomain: 

Definition 51 (Soft Inequation): We say that an S*-inequation is soft if its codomain 
is algebraic. 

Theorem 52: For any set of soft S -inequations A, the category of representations of 
(S, A) has an initial object. 

Proof. The basic ingredients for building the initial representation are given by the initial 
representation AS in the category Rep (S) (cf. Lemma 39) or, equivalently, by the initial 
representation E in Rep(5). We call £ the endomonad underlying the representation £. 
The proof consists of 3 steps: at first, we define a preorder <a on the terms of 
£, induced by the set A of inequations. Afterwards we show that the data of the 
representation £ — substitution, representation morphisms etc. - - is compatible with 
the preorder <a in a suitable sense. This will yield a representation T,a of (5, A). Finally 
we show that £ A is the initial such representation. 
— The monad underlying the initial representation: 
For any set X, we equip T,X with a preorder A by setting, for x, y £ T,X, 

x< A y :^> V22:Rep(S,A), *Jl(aO <R *fl(v) , (2-1) 

where ir ; : £ — > R is the initial morphism of representations in endomonads coming from 
Zsido's theorem (or, equivalently, the initial morphism AS — > R). 
We have to show that the map 

I^S A X:=(EX,< A ) 

yields a relative monad over A. The missing fact to prove is that the substitution of the 
monad £ with a morphism 

/ G Ord(AX, £ A Y) £ Set(X, TY) 

is compatible with the order <a- given any / £ Ord(AX, T<aY) we show that a (/) € 
Set (EX, TiY) is monotone with respect to <a and hence (the carrier of) a morphism 
o~(f) G Ord(E^X, YiaY). We overload the infix symbol 3>= to denote monadic substi- 
tution. Suppose x <a y, we show 

x »= / < A y »= / • 

Using the definition of <a, we must show, for any representation R of (5, A), 
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Since i R is a morphism of representations, it is compatible with the substitutions of E 
and R; we have 

ir(x »= /) = i R (x) 3= i R o / . 

Rewriting this equality and its equivalent for y in the current goal yields the goal 

(r(x) »= iR o f < A i R (y) >= i R o / , 

which is true since the substitution of R (whose underlying map is that of R) is monotone 
in the first argument (cf. Rem. 56) and i R {x) < R i R {y) by assumption. We hence have 
defined a monad Y,a over A. 

We interrupt the proof for an important lemma: 

Lemma 53: Given an algebraic functor V : Rep (S) — > LMod(A,wOrd) from the 
category of representations in monads on A to the large category of modules over such 
modules with codomain category wOrd ; we have 

x< A ye V(T,)(X) & VR : Rep(S, A), V(i R )(x) < v r V(i R )(y) , 

where now and later we omit the argument X, e.g. in V{i R ){X){x). 

Proof of Lemma 53. The proof is done by induction on the derivation of "V algebraic". 
The only interesting case is where V = M x N is a product: 

(xi,yi) < (x 2 , 2/2) <^ x\ < x 2 A y 1 < y 2 

&VR,M(i R )(xi) < M(i R )(x 2 )AVR,N(i R )(y 1 ) < N(i R )(y 2 ) 
&VR,M(i R )( Xl ) < M(i R )(x 2 )AN{i R )( yi ) < N(i R )(y 2 ) 
&VR,V(i R )(xi, yi ) < V{i R ){x 2 ,y 2 ) . 

D 

— Representing S in S^: 

Any arity s € S should be represented by the module morphism s s , i.e. the representation 
of s in E. We have to show that those representations are compatible with the preorder 
A. Given x <a y in dom(s, E)(X), we show (omitting the argument X in s s (X)(x)) 

s s (x) < A s s (y) . 

By definition, we have to show that, for any representation R as before, 

i R {s*{x)) < R i R {s*{y)) . 

Since i R is a morphism of representations, it commutes with the representational module 
morphisms - - the corresponding diagram is similar to the diagram of Def. 35. By 
rewriting with this equality we obtain the goal 



s R 



((dom(a)(t fl ))(aO) < R s R ((dom(s)(i R ))(y)) . 
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This goal is proved by instantiating Lemma 53 with the algebraic S-module dom(s) (cf. 
Def. 47) and the fact that s R is monotone. We hence have established a representation 

— which we call S^ - of S in S^. 

— Tja verifies A: 

The next step is to show that the representation Y,a verifies A. Given an inequation 

a<7: U -+V 

of A with an algebraic S'-module V, we must show that for any set X and any x E 
U(Y>a){X) in the domain of a we have 

al A {x) < A ^{x) . (2.2) 

In the following we omit the subscript X. By Lemma 53 the goal is equivalent to 

VR:Re V (S,A), V(i R )(a^(x)) < v n V(i R )(^ A (x)) . (2.3) 

Let R be a representation of (S, A). We continue by proving (2.3) for R. 
By Remark 42 and the fact that i R is also the carrier of a morphism of S'-represen- 
tations from AS to R (cf. Lemma 38) we can rewrite the goal as 

a R (U{i R ){x)) < v n 7 K ([/(^)(zO) , 

which is true since R verifies A. 

— Initiality o/XU-' 

Given any representation R of (S, A), the morphism i R is monotone with respect to the 
orders on Y,a and R by construction of <a- It is hence a morphism of representations 
from Y>a to R. 

The unicity of the morphisms i R follows from its unicity in the category of represen- 
tations of S, i.e. without inequations. Hence Y,a is the initial object in the category of 
representations of (S,A) . □ 

Example 54 (Ex. 50 continued): The only inequation of the signature A/3 is soft. The 
initial representation of A/3 is given by the monad LC/3 together with the LC/3-module 
morphisms Abs and App (cf. Ex. 12) as representation structure. 

2.4 Some Remarks 

We conclude this paragraph with some remarks about "generating inequalities", substi- 
tution properties and simply-typed syntax: 

Remark 55 (about "generating" inequations): Given a 2-signature (S, A) and a rep- 
resentation R of S, the representation morphism of modules s of any arity s of S is 
monotone. For the initial representation of (5, A) this means that any relation between 
terms of S which comes from A is automatically propagated into subterms. Similarly, 
the relation on those terms is by construction reflexive and transitive, since we consider 
representations in monads with codomain Ord. 



21 



For the example of A/3 this means that in order to obtain the "complete" reduction 
relation -»g, it is sufficient to specify only one inequation for the /3-rule 

(XM)N < M[* := N] . 

Remark 56 (about substitution): The substitution in Ex. 4 is compatible with the 
order on terms in the following sense: 

1. M < N implies M[* := A] < N[* := A] and 

2. A < B implies M[* := A] < M[* := B). 

The first implication is a general fact about relative monads over A: for any such monad 
P and any / G Ord(V, PW), the substitution (Tx,yU) £ Oid(PV, PW) is monotone. 

The second monotony property, however, is false for a monad over A in general. As 
an example, consider the monad given by 

F(V) ::= Var : V ->• F(V) 

I J- = F(V) 

| (=>) . F{ y) -> F(F) -». F(F) , 

equipped with a preorder which is contravariant in the first argument of the arrow 
constructor (=r-). It is then clear that substituting in this position, the first argument of 
(=>), does in fact reverse the order on terms, i.e. we obtain (using =^> infixed) 

A < B implies (* => M)[* := B] < (* => M)[* := A] . 

A different definition of monad which would enforce implication 2 to hold — and hence 
not include the example F — can be given easily by considering Ord as a 2-category: 
given morphisms /, g : Ord(V, W) we say that there is precisely one 2-cell 

f=>g iff f<g iff Vv : V, f(v) < g(v) . 

A monad P would then have to be equipped with a substitution action that is given, for 
any two sets V and W, by a functor (of preorders) 

oyyv : Ord(AV, PW) ->• Ord(PV, PW) . 

Remark 57 (about typed syntax): The initiality theorem generalizes to the simply- 
typed syntax, such as the typed lambda calculus TLC/3 over a set of types T ::= * | T =>■ 
T. In this case representations are in monads 

P:[T, Set] [ ^ ] [T,Ord] , 

or more generally, in a monad over a set of types U, provided there is a "translation of 
types" f:T^U, 

Q : [U, Set] ^ [U, Ord] . 

In addition to the derivation functor the theorem makes use of a family of fibre functors 

Fib(t) : RMod(P, [T, Ord]) ->• RMod(P, Ord) 

which serve to pick out terms of an object type t £ T. This extension to simply-typed 
syntax will be reported elsewhere. 
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3 Formalization in the proof assistant Coq 

In this section we explain some elements of the formalization of the initiality theorem 
in the proof assistant Coq. However, we only explain the implementations of definitions 
and lemmas that are specific to the theorem. We base ourselves on a general library of 
category theoretic concepts the formalization details of which we do not go into. The 
interested reader can find an in-depth description and the complete Coq code on the 
author's web page. The implementation of categories, endomonads and modules over 
endomonads (which are analoguous to the implementation of their relative counterparts 
used here) is explained in detail in [AZ11]. 

For a morphism / from object a to object b in any category we write f : a > b. 

Composition of morphisms / : a — > b and g : b — > c is written f ;; g. 

3.1 Category of Representations of a 1-Signature 

We start with the formal definition of 1-signature, cf. Def. 28: 

Notation "[ T ]" := (list T) (at level 5). 
Record Signature : Type := { 

sig_index : Type ; 

sig : sig_index — > [nat] }. 

Next we formalize context extension according to a natural number, cf. Sec. 1.3. These 
definitions are important for the definition of the module morphisms we associate to an 
arity, cf. below. Context extension is actually functorial: 

Fixpoint pow (n : nat) (V : TYPE) : TYPE : = 

match n with 

| => V 

| S n' => pow n' (option V) 

end. 
Notation "V ** I" := (pow I V) (at level 10). 

Fixpoint pow_map (I : nat) VW (f : V > W) : 

V ** I > W ** I : = 

match I return V ** I > W ** I with 

| 0=> f 

| S n' => pow_map (~ f) 

end. 

The product module of a module M over monad P associated to a list of natural 
numbers (cf. Defs. 21, 23) consists of products of "terms" of the module M over contexts 
that are extended according to the list of naturals. The relation on the product is 
induced by that on M. We omit the construction of the product module substitution 
and the proof of the module properties thereof: 

Variable M : RModule P PO. 

Inductive prod_mod_c (V : TYPE) : [nat] — > Type : = 
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I TTT : prod_mod_c V nil 
I CONSTR : forall b bs, 

M (V ** b)— > prod_mod_c V bs — > prod_mod_c V (b::bs). 
Inductive prod_mod_c_rel (V : TYPE) : forall n, relation (prod_mod_c M V n) : = 
| TTT_rel : forall x y : prod_mod_c M V nil, prod_mod_c_rel x y 
| CONSTR_rel : forall n I, forall x y : M (V ** n), 
forall a b : prod_mod_c M V I, 
x << y — > prod_mod_c_rel a b — > 
prod_mod_c_rel (CONSTR x a) (CONSTR y b). 

Definition prod_mod I := Build_RModule (prod_mod_struct I). 

Note that the infixed "<<" is overloaded notation and denotes the relation of any pre- 
ordered set. 

To any arity ar : [nat] and a module M over a monad P we associate a type of module 
morphisms modhom_from_arity ar M. Representing ar in M then means giving a term of 
type modhom_from_arity ar M. Note that in the corresponding Def. 30 we have defined 
representations in monads only. Indeed we instantiate M with the tautological module 
later. 

Definition modhom_from_arity (ar : [nat]) (M : RModule P PO): Type := 
RModule_Hom (prod_mod M ar) M. 

For the rest of the section, we suppose a representation S to be given via a Coq section 
variable. 

As just mentioned, representing the signature S in a monad P (cf. Def. 31) means 
providing a suitable module morphism for any arity of S, i.e. providing, for any element 
of the indexing set sig_index S, a term of type modhom_from_arity P (sig i): 

Definition Repr (P : RMonad SM_po) := forall i : sig_index S, 

modhom_from_arity P (sig i). 
Record Representation := { 

rep_monad :> RMonad SM_po ; 

repr : Repr rep_monad }. 

Note that the projecton rep_monad is declared as a coercion by using the special syntax 
:>. This coercion allows for abuse of notation in Coq as we do informally according to 
Def. 31. See the first paragraph of Sec. 3.3 for a use of this abuse. 

The definition of morphism between two representations of S (cf. Def. 35) necessitates 
the definition of the two sides of the commutative diagram, for any arity of S. 

We first define, more generally, the commutative diagram for an arity a and represen- 
tations RepP in a monad P and RepQ in a monad Q: 

Variable a : [nat]. 

Variable RepP : modhom_from_arity P a. 
Variable RepQ : modhom_from_arity Q a. 
Definition commute left : = 
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Prod_mor a ;; f* RepQ . 
Definition commute_right := RepP ;; PbRMod_ind_Hom f . 
Definition commute := commute_left == commute_right. 

Using the preceding definition, we define morphisms of representations of S: 

Class Representation_Hom_struct (P Q : Representation) (f : RMonad_Hom P Q) 
repr_hom_s : forall i : sig_index S, 

commute f (repr P i) (repr Q i). 
Record Representation_Hom : Type := { 
repr_hom_c :> RMonad_Hom P Q; 
repr_hom :> Representation_Hom_struct repr_hom_c }. 

Afterwards we define the category REP of representations of S, cf. Leni. 37: 

Program Instance REPRESENTATION_struct : 

Cat_struct (@Representation_Hom S) := { 
mor_oid a c := eq_Rep_oid a c; 
id a := Rep_ld a; 
comp P Q R f g := Rep_Comp f g }. 

Definition REP := Build Cat REPRESENTATION struct. 



3.2 Initial Representation of a 1-Signature 

While the initial representation of a 1-signature can be obtained via the adjunction of 
Lem. 38, we construct it directly. 

The carrier of the initial object is given by a mutually inductive Coq data type. 

Inductive UTS (V : TYPE) : TYPE := 

| Var : V -> UTS V 

| Build : forall (i : sig_index Sig), UTSJist V (sig i) -> UTS V 
with 
UTSJist (V : TYPE) : [nat] -> Type : = 

| TT : UTSJist V nil 

| constr : forall b bs, UTS (V ** b) -> UTSJist V bs -> UTSJist V (b::bs). 

We then define renaming of variables, ie. functoriality, as a mutually recursive function: 

Fixpoint rename (V W: TYPE ) (f : V > W) (v : UTS V):= 

match v in UTS _ return UTS W with 
| Var v => Var (f v) 

| Build i I => Build (I // f) 

end 
with 

list_rename V t (I : UTSJist V t) W (f : V > W) : UTSJist W t := 

match I in UTSJist _ t return UTSJist W t with 
I TT => TT W 
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I constr b bs elem elems => 

constr (elem //— ( f ^^ b)) (elems // f) 

end 
where "x //— f" := (rename f x) 
and "x // — f" := (list_rename x f). 

Analoguously to renaming, substitution is denned by mutual recursion. The helper 
function Ishift is used to adapt substitution maps f : V > UTS W to extended con- 
texts, e.g. Ishift f : V ' ' I > UTS (W ^^ I). The number I of additional variables 

is an implicit argument of Ishift: 

Fixpoint subst (V W : TYPE) (f : V > UTS W) (v : UTS V) : 

UTS W := match v in UTS _ return UTS _ with 
| Var v => f v 

| Build i I => Build (I »==f) 
end 
with 

list_subst V W t (I : UTSJist V t) (f : V > UTS W) : UTSJist W t : = 

match I in UTSJist _ t return UTSJist W t with 

| TT => TT W 

| constr b bs elem elems => 

constr (elem >== (Jshift f)) (elems >>== f) 
end 
where "x >== f" := (subst f x) 
and "x >>== f" := (list_subst x f). 

Finally we obtain a relative monad UTSM equipped with the pointwise diagonal pre- 
ordering, which becomes the underlying monad of a representation UTSRepr of S. 

In the following we suppose R to be an arbitrary representation of the signature S. 
The (carrier of) the initial morphism is defined by mutual recursion: 

Fixpoint init V (v : UTS V) : R V : = 

match v in UTS _ return R V with 
| Var v => rweta (RMonad_struct := R) V v 
J Build i X => repr R i V (initjist X) 
end 
with 
initjist I (V : TYPE) (s : UTSJist V I) : prod_mod R I V : = 
match s in UTSJist _ I return prod_mod R I V with 
I TT => TTT 

| constr b bs elem elems => CONSTR (init elem) (initjist elems) 
end. 

Some lemmas then establish that init is the carrier of a morphism of monads, 

Lemma initjift V x W (f : V > W) : init (x //- f) = rlift R f (init x). 

Lemma initjdeisli V (v : UTS V) W (f : SM_po V > UTS_sm W) : 

init (v >== f) = rkleisli (f ;; @init_sm _ ) (init v). 
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and, in fact, of a morphism of representations, named init rep. The last lemma estab- 
lishes unicity of init rep: 

Lemma init_unique :forall f : UTSRepr > R , f == init rep. 

3.3 Inequations and Initial Representation of a 2-Signature 

For a 1-signature S, an S-module is defined to be a functor from representations of 
S to the category whose objects are pairs of a monad P and a module M over P, cf. 
Def. 41. We don't need the functor properties, and use dependent types instead of the 
cumbersome category of pairs, in order to ensure that a representation in a monad P is 
mapped to a P-module. 

The below definition makes use of two coercions. Firstly, we may write a : C because 
the "object" projection of the category record (whose definition we omit) is declared as 
a coercion. Secondly, the monad underlying any representation can be accessed without 
explicit projection using the coercion we mentioned above. 

Class S_Module_s (Tau : forall R : REP S, RMOD R wPO) := { 

S_Mod_Hom : forall (R S : REP S) (f : R > S), 

Tau R > PbRMod f (Tau S) }. 

Record S_Module := { 
s_mod :> forall R : REP S, RMOD R wPO ; 
s_mod_hom :> S_Module_s s_mod }. 

A half-equation is a natural transformation between S-modules. We need the natu- 
rality condition in the following. Since we have not formalized S-modules as functors, 
we have to state naturality explicitly: 

Class half_equation_struct (U V : S_Module) 

(half_eq : forall R : REP S, s_mod U R > s_mod V R) := { 

comm_eq_s : forall (R S : REP S) (f : R > S), 

S_Mod_Hom (S_Module_s := U) f ;; PbRModJHom _ (half_eq S) == 
half_eq R ;; S_Mod_Hom (S_Module_s := V) f }. 

Record half_equation (U V : S_Module) := { 

half_eq :> forall R : REP S, U R > V R ; 

half_eq_s :> half_equation_struct half_eq }. 

We now formalize algebraic S-modules. Any list of natural numbers specifies an 
algebraic S'-module, cf. Def. 44. Given a list of naturals codl, we call this S'-module 
S_Mod_alg codl. We call algebraic half-equation any half-equation with an algebraic 
codomain, and an algebraic inequation is a pair of parallel algebraic half-equations (cf. 
Def. 51): 

Definition half_eq_alg (U : S_Module)(codl : [nat]) := 

half_equation U (S_Mod_alg codl). 
Record eq_alg := { 



27 



Dom : S_Module ; 

Cod : [nat] ; 

eql : half_eq_alg Dom Cod ; 

eq2 : half_eq_alg Dom Cod }. 

Give a representation P and an (algebraic) inequation e, we check whether P satisfies 
e by pointwise comparison (cf. Def. 49). 

Definition verifies_eq (e : eq_alg) (P : REP S) := 
forall c (x : Dom e P c), 

eql x<< eq2 x. 

(* for a family of inequations indexed by a set A *) 

Definition Prop_Sig (A : Type) := A — > eq_alg. 
Definition verifies_psig A (T : Prop_Sig A) (R : REP S) : = 
forall a, verifies_eq (T a) R. 

Suppose given a set A of inequations (formally, a family of inequations indexed by 
a type A) over the signature S. The category of representations of (S, A) is obtained 
as a full subcategory of the category of representations of S. The following declara- 
tion produces a subcategory from predicates on the type of representations and on the 
(dependent) type of morphisms of representations, yielding the category PROP_REP of 
representations of (S,A): 

Variable A : Type. 
Variable T : Prop_Sig A. 

Program Instance Prop_Rep : SubCat_compat (REP S) 
(fun P => verifies_psig T P) (fun a b f => True). 
Definition PROP_REP : Cat := SubCat Prop_Rep. 

We now construct the initial object of PROP_REP. The relation on the initial object 
is defined precisely as in the paper proof, cf. Eq. (2.1). 

Definition prop_rel_c X (x y : UTS S X) : Prop := 

forall R : PROP_REP, init (FINJ _ R) x << init (FINJ _ R) y. 

Here, FINJ _ R denotes the representation R as a representation of S, i.e. the injection 
of R in the category REP S of representations of S. 

The relation defined above is indeed a preorder, and we define the monad UTSP to 
be the monad whose underlying sets are identical to UTSM, namely the sets defined by 
UTS, but equipped with this new preorder. This monad UTSP is denoted by E^ in the 
paper proof. 

The representation module morphisms of the initial representation UTSRepr can be 
"reused" after having proved their compatibility with the new order, yielding a repre- 
sentation UTSProp. An important lemma states that this representation verifies the 
inequations of T: 

Lemma UTSPRepr_sig_prop : verifies_psig T UTSProp. 

We have to explicitly inject the representation into the category of representations of 
(S,A): 
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Definition UTSPROP : PROP_REP : = 

exist (fun R : Representation S => verifies_psig (A:=A) T R) UTSProp 
UTSPRepr_sig_prop. 

For building the initial morphism towards any representation R : PROP_REP, we first 
build the corresponding morphism in the category of representations of S: 

Definition init_prop_re : UTSPropr > (FINJ _ R) := ... 

which we then inject, analoguously to the initial representation, into the subcategory of 
representations of (S,A): 

Definition init prop : UTSPROP > R := exist _ (init_prop_re R) I. 

Finally we obtain our Thm. 52: an initial object of a category is given by an object Init 

of this category, a map associating go any object R a morphism InitMor R : Init > R, 

and a proof of uniqueness of any such morphism. 

Program Instance INITIAL_PROP : Initial PROP_REP := { 
Init := UTSPROP ; 

InitMor := init prop ; 

InitMorUnique := init_prop_unique }. 



4 Conclusions & Future Work 

We have presented an initiality result for abstract syntax which integrates semantics 
specified by reduction rules by means of preorders. It is based on relative monads and 
modules over such monads. Reduction rules are specified by inequations, whose definition 
we borrow from [HM07a]. For any 2-signature (S, A) with soft inequations we prove the 
existence of an initial object in the category of representations of (S,A). The theorem 
is proved formally in the proof assistant Coq. 

We hope that this work is a first step to a framework for reasoning about abstract 
syntax in a mathematical, category-theoretic setting. 

On another line of work [Ahrll] we are extending Zsido's initiality result [ZsilO, Chap. 
6] to allow for varying object types. In this way initiality accounts for compilation 
between programming languages over different types. 

Both lines of work, varying object types and the integration of semantics, can be 
combined (cf. Rem. 57): in a forthcoming work we will prove an initiality result which 
allows for specification of reductions as well as change of object types. As an example, 
we consider the language PCF with its usual small-step semantics. By equipping LC/3 
with a representation of PCF, we obtain a translation of PCF to LC which is faithful 
w.r.t. semantics. 

Our approach should be extended to more complicated type systems such as depen- 
dent types and polymorphism. Another interesting feature to work on are conditional 
reductions/rewritings . 
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